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\- , ^ - r SEQUENCE LISTING 

v " * 

. ( GENERAL^ INFORMATION : 

. APPLICANT: Mosselman, Sieste 

Dijkema, Rein 

(ii) TITLE OF INVENTION: Novel estrogen receptor 

(iii) NUMBER OF SEQUENCES: 28 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Akzo Nobel Patent Dept. 

(B) STREET: 1300 Piccard Drive, Suite 206 

(C) CITY: Rockville 

(D) STATE: Maryland 

( E ) COUNTRY : US 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: .. IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/826,361 

(B) FILING DATE: 26-MAR-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Gormley, Mary E. 

(B) REGISTRATION NUMBER: 34,409 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 301-948-7400 

(B) TELEFAX: 301-948-9751 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



\ 3 0 

(xi) SEQUENCE INSCRIPTION: SEQ ID NO: 1: 

\ 

ATGAATTACA GCATTCCCA§ CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 6 0 

ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 120 

CAGTTATCAC ATCTGTATGC GGyVACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 180 

GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 24 0 

GCCAGCCCTG TTACTGGTCC AGGTT^AAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 3 00 

\ 

GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 3 60 

\ 

AAAAGAAG C A TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 4 20 

A 

GATAAAAACC GGCGCAAGAG CTGCCAG(/C<S TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 4 80 

f 

! \ 

ATGGTGAAGT GTGGCTCCCG GAGAGAGA0A 'JFGTGGGTACC GCCTTGTGCG GAGACAGAGA 54 0 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC A^GGCCAAGA GAAGTGGCGG CCACGCGCCC 6 00 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGC^CCGAGC AGCTAGTGCT CACCCTCCTG 66 0 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCKCAGTG CGCCCTTCAC CGAGGCCTCC 72 0 

ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAC^TGG TACACATGAT CAGCTGGGCC 78 0 

AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 84 0 

TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCt CAATTGACCA CCCCGGCAAG 9 00 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG NGGAAATGCGT AGAAGGAATT 96 0 

\ 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT T^CGAGAGTT AAAACTCCAA 102 0 

CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATtCCAGTAT GTACCCTCTG 10 8 0 

3TCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 114 0 

A'^GATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCC^CAGCA GCAATCCATG 12 0 0 

CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGA^TAA CAAGGGCATG 12 6 0 

GAACATCTGC TCAACATGAA GTGCAAAAAT GTGGTCCCAG TGTATGA^CT GCTGCTGGAG 13 2 0 

\ 

ATGCTGAATG CCCACGTGCT TCGCGGGTGC AAGTCCTCCA TCACGGGG^C CGAGTGCAGC 13 8 0 

\ 

CCGGCAGAGG ACAGTAAAAG CAAAGAGGGC TCCCAGAACC CACAGTCTCA\ GTGA 14 34 



\ 

\ 
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2 i INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 6 0 

ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 12 0 

"-AGTTATCAC ATCTGTATGC GGAACCT CAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 18 0 

GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 24 0 

GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 3 00 

GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 3 60 

AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 420 

GATAAAAACC GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 4 80 

ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 54 0 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 6 00 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 72 0 

ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 78 0 

AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 84 0 

TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 9 00 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACT C CAA 102 0 

TACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 10 80 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 114 0 
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ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 12 0 0 

CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTG A 12 51 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ill MOLECULE TYPE: peptide, 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp 

15 10 15 

Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His 
20 25 30 

Asn Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn 
35 40 45 

Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val 
50 55 60 



Gly Met 
65 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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!xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

Leu Val Leu \Thr Leu Leu Glu Ala Glu Pro Pro His 
1 \ 5 10 



Val Leu lie Ser 
15 



Arg Pro Ser Alay Pro Phe Thr Glu Ala Ser Met Met 
2 0 \ 2 5 

Lys Leu Ala Asp Ly$ Glu Leu Val His Met lie Ser 

3 5 \ 4 0 

lie Pro Gly Phe Val ($iu Leu Ser Leu Phe Asp Gin 
5 0 \ 5 5 6 0 

\ 

Glu Ser Cys Trp Met Glu Val Leu Met Met Gly Leu 
6 5 7 0 7 5 

He Asp His Pro Gly Lys L£u. He Phe Ala Pro Asp 

85 90 

Arg Asp Glu Gly Lys Cys Val Glu Gly He Leu Glu 
10 0 \ 10 5 

Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu Leu Lys 
115 120 

\ 

Glu Tyr Leu Cys Val Lys Ala Met He Leu Leu Asn 
130 135 ' 140 

Pro Leu Val Thr Ala Thr Gin Asp Ala Asp Ser Ser 

145 150 155 

\ 

His Leu Leu Asn Ala Val Thr Asp Ala Leu Val Trp 

165 170 

Ser Gly He Ser Ser Gin Gin Gin Ser Met Arg \eu 
18 0 18 5 

Met Leu Leu Ser His Val Arg His Ala Ser Asn Ly& 
19 5 2 0 0 

Leu Leu Asn Met Lys Cys Lys Asn Val Val Pro Val 
210 215 220 

Leu Glu Met Leu Asn Ala His Val Leu 

225 230 



Met Ser Leu Thr 
30 

Trp Ala Lys Lys 
45 

Val Arg Leu Leu 



Met Trp Arg Ser 
80 

Leu Val Leu Asp 
95 

He Phe Asp Met 
110 

Leu Gin His Lys 
125 

Ser Ser Met Tyr 



Arg Lys Leu Ala 

160 

Val He Ala Lys 
175 

Ala Asn Leu Leu 
190 

Gly Met Glu His 
205 

Tyr Asp Leu Leu 



INFORMATION' FOR SEQ ID NO: 5: 



(1) SEQUENCE' CHARACTERISTICS: 

(A) LENGTH: 477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
15 10 15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 
20 25 30 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
35 40 45 

Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 
50 55 60 



Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 

65 70 75 80 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 

85 90 95 



Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
100 105 110 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 
115 120 125 



Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 
130 135 140 



Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 

145 150 155 160 

Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 

165 170 175 



Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 
180 185 190 



Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 
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Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
210 215 220 



Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe 
225 230 235 



Thr Glu Ala Ser 
240 



Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu 

245 250 



Leu Val His Met 
255 



lie Ser Trp Ala Lys Lys lie Pro Gly Phe Val Glu 
260 265 



Leu Ser Leu Phe 
270 



Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu 
275 280 



Val Leu Met Met 
285 



Gly Leu Met Trp Arg Ser lie Asp His Pro Gly Lys 
290 295 300 



Leu lie Phe Ala 



Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys 
305 310 315 



Val Glu Gly He 
320 



Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr Ser 

325 330 



Arg Phe Arg Glu 
335 



Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys 
340 345 



Ala Met He Leu 
350 



jeu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr 
355 360 



Gin Asp Ala Asp 
365 



Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 
370 375 380 



Val Trp Val He Ala Lys Ser Gly He Ser Ser Gin 
385 390 395 



Gin Gin Ser Met 
400 



Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val 

405 410 



Arg His Ala Ser 
415 



Asn Lys Gly Met Glu His Leu Leu Asn Met Lys Cys 
420 425 



Lys Asn Val Val 
430 



Pro Val Tyr Asp Leu Leu Leu Glu Met Leu Asn Ala 
435 440 



His Val Leu Arg 
445 



;ly Cys Lys Ser Ser He Thr Gly Ser Glu Cys Ser Pro Ala Glu Asp 
450 455 460 



Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin Ser Gin 
465 470 475 
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2) INFORMATICS FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE* amino acid 

(C) STRAN£>EDNESS : single 

(D) TOPOLOGY : unknown 

(li) MOLECULE TYPE :\ protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
15 10 15 

Gly Arg Gin Thr Thr Ser Prcj- Asn Val Leu Trp Pro Thr Pro Gly His 

2 0 /' 2 5 3 0 

i 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 

35 40 45 

Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 
50 55 60 

Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 
65 70 75 80 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 

85 90 95 

Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
100 105 110 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 
115 120 125 

Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 
130 135 140 

Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 
145 150 155 160 

Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly T^r Arg Leu Val 

165 170 175 

Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 
180 185 190 



Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 
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Ala Lett Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
210 215 220 

Pro His Val^Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 V 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
x 245 250 255 

He Ser Trp Ala Lys Lys He Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser He Asp His Pro Gly Lys Leu He Phe Ala 
290 2,95 300 

Pro Asp Leu Val Leu Asp Asp Glu Gly Lys Cys Val Glu Gly He 

305 310 315 320 

Leu Glu He Phe Asp Met Leu L6u Ala Thr Thr Ser Arg Phe Arg Glu 

325 330 335 

Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met He Leu 
340 345 350 

Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 
355 360 N 365 

Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 
370 375 380 

Val Trp Val He Ala Lys Ser Gly He Ser Ser Gin Gin Gin Ser Met 
385 390 395 400 

Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Arg 



405 



410 



415 



(2) INFORMATION FOR SEQ ID NO : 7: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 



(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: unknown 



(li) MOLECULE TYPE: cDNA 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

\ 

GGIGAYGARG CW^CIGGITG YCAYTAYGG 
(.2.) INFORMATION, FOR SEQ ID NO : 8: 

(i) SEQUENCE-, CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAGCCTGGSA YICKYTTIGC CCAIYTIA^- 
(2) INFORMATION FOR SEQ ID NO;': 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGTTACGAAG TGGGAATGGT GA 
(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTGACACCAG ACCAACTGGT AATG 




29 



29 



22 
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(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GGTGGCGACG ACTCCTGGAG CCCG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GTACACTGAT TTGTAGCTGG AC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCATGATGAT GTCCCTGACC 
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(2) INFORMATION FOR SEQ ID NO: 14: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) sequence description: seq id no: 14 
t-.:gcatgcct gacgtgggac 
(2) information for seq id no: 15: 
(i) sequence characteristics: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear' 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GGCSTCCAGC ATCTCCAGSA RCAG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GGAAGCTGGC TCACTTGCTG 
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INFORMATION FOR SEQ ID NO: 17: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TCTTGTTCTG GACAGGGATG 
f (2) INFORMATION FOR SEQ ID NO: 18: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GCATGGAACA TCTGCTCAAC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGCAAGTTCA GCCTGTTAAG T 
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■;2) INFORMATION FOR SEQ ID NO: 20: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



ATGAATTACA 


GCATTCCCAG 


CAATGTCACT 


AACTTGGAAG 


GTGGGCCTGG 


TCGGCAGACC 


60 


ACAAGCCCAA 


ATGTGTTGTG 


GCCAACACCT 


GGGCACCTTT 


CTCCTTTAGT 


GGTCCATCGC 


120 


CAGTTATCAC 


ATCTGTATGC 


GGAACCTCAA 


AAGAGTCCCT 


GGTGTGAAGC 


AAGATCGCTA 


180 


GAACACACCT 


TACCTGTAAA 


CAGAGAGACA 

i 


CTGAAAAGGA 


AGGTTAGTGG 


GAACCGTTGC 


240 


GCCAGCCCTG 


TTACTGGTCC 


AGGTTCAAAC^A^GGATGCTC 


ACTTCTGCGC 


TGTCTGCAGC 


300 


GATTACGCAT 


CGGGATATCA 


CTATGGAGTC 


TGGTCGTGTG 


AAGGATGTAA 


GGCCTTTTTT 


3 6 0 


AAAAGAAGCA 


TTCAAGGACA 


TAATGATTAT 


ATTTGTCCAG 


CTACAAATCA 


GTGTACAATC 


420 


jATAAAAACC 


GGCGCAAGAG 


CTGCCAGGCC 


TGCCGACTTC 


GGAAGTGTTA 


CGAAGTGGGA 


430 


ATGGTGAAGT 


GTGGCTCCCG 


GAGAGAGAGA 


TGTGGGTACC 


GCCTTGTGCG 


GAGACAGAGA 


54 0 


AGTGCCGACG 


AGCAGCTGCA 


CTGTGCCGGC 


AAGGCCAAGA 


GAAGTGGCGG 


CCACGCGCCC 


6 0 0 


CGAGTGCGGG 


AGCTGCTGCT 


GGACGCCCTG 


AGCCCCGAGC 


AGCTAGTGCT 


CACCCTCCTG 


660 


jAGGCTGAGC 


CGCCCCATGT 


GCTGATCAGC 


CGCCCCAGTG 


CGCCCTTCAC 


CGAGGCCTCC 


720 


ATGATGATGT 


CCCTGACCAA 


GTTGGCCGAC 


AAGGAGTTGG 


TACACATGAT 


CAGCTGGGCC 


730 


AAGAAGATTC 


CCGGCTTTGT 


GGAGCTCAGC 


CTGTTCGACC 


AAGTGCGGCT 


CTTGGAGAGC 


840 


TGTTGGATGG 


AGGTGTTAAT 


GATGGGGCTG 


ATGTGGCGCT 


CAATTGACCA 


CCCCGGCAAG 


900 


CTCATCTTTG 


CTCCAGATCT 


TGTTCTGGAC 


AGGGATGAGG 


GGAAATGCGT 


AGAAGGAATT 


96 0 


CTGGAAATCT 


TTGACATGCT 


CCTGGCAACT 


ACTTCAAGGT 


TTCGAGAGTT 


AAAACTCCAA 


1020 


CACAAAGAAT 


ATCTCTGTGT 


CAAGGCCATG 


ATCCTGCTCA 


ATTCCAGTAT 


GTACCCTCTG 


108 0 


jTCACAGCGA 


CCCAGGATGC 


TGACAGCAGC 


CGGAAGCTGG 


CTCACTTGCT 


GAACGCCGTG 


1140 
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ACCGATGCTT TGO.TTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 12 0 0 

\ 

CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTC TGCCTGA 12 57 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



;xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Asn Tyr Ser He Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 

1 5 . ^ 10 15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 

20 25 30 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
35 40 45 

Pro Gin Lys Ser Pro Trp Cys Glu Aia Arg Ser Leu Glu His Thr Leu 
50 55 60 

Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 

65 70 75 80 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 

85 90 , 95 

Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
100 105 HO 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser He Gin Gly His Asn 
115 120 125 

Asp Tyr He Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 
130 135 14ft. 

Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 

145 150 155 \ 160 

Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly TyV Arg Leu Val 

165 170 \ 175 

\ 
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Arq Arq Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 
180 185 190 

Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 

Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
210 215 220 

Pro His Val Leu He Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 

245 250 255 

lie Ser Trp Ala Lys Lys He Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser He Asp His Pro Gly Lys Leu He Phe Ala 
290 295 300 

Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly He 
305 310 315 320 

Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 

325 330 335 

Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met He Leu 
340 345 350 

Lpu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 
355 360 365 

Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 
370 375 380 

Val Trp Val He Ala Lys Ser Gly He Ser Ser Gin Gin Gin Ser Met 
385 390 395 400 

Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Arg 

405 410 415 



Ser Ala 




i2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTTGGATCCA TAGCCCTGCT GTGATGAATT ACAG 
(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GATGGATCCT CACCTCAGGG CCAGGCGTCA CTG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CACGAATCTT TGAGAACATT ATAATGACCT TTGTGCCTCT TCTTGCAAGG TGTTTTCTCA 
GCTGTTATCT CAAGACATGG ATATAAAAAA CTCACCATCT AGCCTTAATT CTCCTTCCTC 
CTACAACTGC AGTCAATCCA TCTTACCCCT GGAGCACGGC TCCATATACA TACCTTCCTC 
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CTAT3TAGAC 


AGCCACCATG 


AATATCCAGC 


CATGACATTC 


TATAGCCCTG 


CTGTGATGAA 


2 4 0 


TTACAGCATT 


CCCAGCAATG 


TCACTAACTT 


GGAAGGTGGG 


CCTGGTCGGC 


AGACCACAAG 


3 0 j 


CCCAAATGTG 


TTGTGGCCAA 


CACCTGGGCA 


CCTTTCTCCT 


TTAGTGGTCC 


ATCGCCAGTT 


3 6 0 


ATCACATCTG 


TATGCGGAAC 


CTCAAAAGAG 


TCCCTGGTGT 


GAAGCAAGAT 


CGCTAGAACA 


42 0 


:agcttacct 


GTAAACAGAG 


AG AC A C T G AA 


AAGGAAGGTT 


AGTGGGAACC 


GTTGGGCCAG 


4 8 0 


g: :tgttact 


GGTCCAGGTT 


CAAAGAGGGA 


TGCTCACTTC 


TGCGCTGTCT 


GCAGCGATTA 


54 0 


cgcatcggga 


TATCACTATG 


GAGTCTGGTC 


GTGTGAAGGA 


TGTAAGGCCT 


TTTTTAAAAG 


6 0 0 


AAGCATTCAA 


GGACATAATG 


ATTATATTTG 


TCCAGCTACA 


AATCAGTGTA 


CAATCGATAA 


66 0 


AAACCGGCGC 


AAGAGCTGCC 


AGGCCTGCCG 


ACTTCGGAAG 


TGTTACGAAG 


TGGGAATGGT 


72 0 


GAAGTGTGGC 


TCCCGGAGAG 


AGAGATGTGG 


GTACCGCCTT 


GTGCGGAGAC 


AGAGAAGTGC 


78 0 


CGACGAGCAG 


CTGCACTGTG 


CCGGCAAGGC 


CAAGAGAAGT 


GGCGGCCACG 


CGCCCCGAGT 


840 


GCGGGAGGTG 


CTGCTGGACG 


CCCTGAGCCC 


CGAGCAGCTA 


GTGCTCACCC 


TCCTGGAGGC 


9 0 0 


TGAGGCGCCC 


CATGTGCTGA 


TCAGCCGCCC 


CAGTGCGCCC 


TTCACCGAGG 


CCTCCATGAT 


960 


GATGTCGCTG 


ACCAAGTTGG 


CCGACAAGGA 


GTTGGTACAC 


ATGATCAGCT 


GGGCCAAGAA 


1020 


3ATTCCCGGC 


TTTGTGGAGC 


TCAGCCTGTT 


CGACCAAGTG 


CGGCTCTTGG 


AGAGCTGTTG 


1 0 8 0 


GATGGAGGTG 


TTAATGATGG 


GGCTGATGTG 


GCGCTCAATT 


GACCACCCCG 


GCAAGCTCAT 


1140 


CTTTGCTCCA 


GATCTTGTTC 


TGGACAGGGA 


TGAGGGGAAA 


TGCGTAGAAG 


GAATTCTGGA 


1200 


AATCTTTGAC 


ATGCTCCTGG 


CAACTACTTC 


AAGGTTTCGA 


GAGTTAAAAC 


TCCAACACAA 


1260 


AGAATATGTC 


TGTGTCAAGG 


CCATGATCCT 


GCTCAATTCC 


AGTATGTACC 


CTCTGGTCAC 


1320 


AGGGACCGAG 


GATGCTGACA 


GCAGCCGGAA 


GCTGGCTCAC 


TTGCTGAACG 


CCGTGACCGA 


13 8 0 


TGCTTTGGTT 


TGGGTGATTG 


CCAAGAGCGG 


CATCTCCTCC 


CAGCAGCAAT 


CCATGCGCCT 


14 4 0 


GGGTAACCTC 


CTGATGCTCC 


TGTCCCACGT 


CAGGCATGCG 


AGTAACAAGG 


GCATGGAACA 


15 0 0 


TGTGCTCAAC 


ATGAAGTGCA 


AAAATGTGGT 


CCCAGTGTAT 


GACCTGCTGC 


TGGAGATGCT 


1560 


GAATGCGCAC 


GTGCTTCGCG 


GGTGCAAGTC 


CTCCATCACG 


GGGTCCGAGT 


GCAGCCCGGC 


1620 


AGAGGAGAGT 


AAAAGCAAAG 


AGGGCTCCCA 


GAACCCACAG 


TCTCAGTGAC 


GCCTGGCCCT 


1680 


3AGGTGAACT 


GGCCCACAGA 


GGTCACAAGC 


TGAAGCGTGA 


ACTCCAGTGT 


GTCAGGAGCC 


1740 



4 7 

TGGGCTTCAT CTTTCTGCTG TGTGGTCCCT CATTTGGTGA TGGCAGGCTT GGTCATGTAC 
CATCCTTCCC TCCACCTTCC CAACTCTCAG GAGTCGGTGT GAGGAAGCCA TAGTTTCCCT 
TGTTAGCAGA GGGACATTTG AATCGAGCGT TTCCACAC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



1800 
1860 
1898 



;xi) SEQUENCE DESCRIPTION: SEQ ID NO : 25: 

Met Asp lie Lys Asn Ser Pro Ser Ser Leu Asn Ser Pro Ser Ser Tyr 
1 5 10 15 

Asn Cys Ser Gin Ser He Leu Pro Leu Glu His Gly Ser lie Tyr lie 
20 25 30 

Pro Ser Ser Tyr Val Asp Ser His His Glu Tyr Pro Ala Met Thr Phe 
35 40 45 

Tvr Ser Pro Ala Val Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn 
50 55 60 

Leu Glu Gly Gly Pro Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp 

65 70 75 80 

Pro Thr Pro Gly His Leu Ser Pro Leu Val Val His Arg Gin Leu Ser 

85 90 95 

His Leu Tyr Ala Glu Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser 
100 105 HO 

Leu Glu His Thr Leu Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val 
115 120 125 

Ser Gly Asn Arg Cys Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg 
130 135 140 

Asp Ala His Phe Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His 
14 5 150 155 160 
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T Vr Glv Val Trp Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser 

165 170 175 

He Gin Gly His Asn Asp Tyr He Cys Pro Ala Thr Asn Gin Cys Thr 
180 185 190 

He Asp Lys Asn Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys 
195 200 205 

Cys Tyr Glu Val Gly Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys 
210 215 220 

Gly Tyr Arg Leu Val Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His 
225 230 235 240 

r ys Ala Gly Lys Ala Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg 

245 250 255 

Glu Leu Leu Leu Asp Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu 
260 265 270 

Leu Glu Ala Glu Pro Pro His Val Leu He Ser Arg Pro Ser Ala Pro 
275 280 285 

Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys 
290 295 300 

Glu Leu Val His Met He Ser Trp Ala Lys Lys He Pro Gly Phe Val 
305 310 315 320 

Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met 

325 330 335 

Glu Val Leu Met Met Gly Leu Met Trp Arg Ser He Asp His Pro Gly 
340 345 350 

Lys Leu He Phe Ala Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys 



355 



360 



365 



Cys Val Glu Gly He Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr 



370 



375 



380 



Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val 
385 390 395 400 

Lys Ala Met He Leu Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala 

405 410 415 

Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala 
420 425 430 
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Val Thr Asp Ala Leu Val Trp Val lie Ala Lys Ser Gly lie Ser Ser 
435 440 445 

Gin Gin Gin Ser Met Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His 
450 455 460 

Val Ara His Ala Ser Asn Lys Gly Met Glu His Leu Leu Asn Met Lys 
465 470 475 480 

r ys Lys Asn Val Val Pro Val Tyr Asp Leu Leu Leu Glu Met Leu Asn 

485 490 495 

Ala His Val Leu Arg Gly Cys Lys Ser Ser lie Thr Gly Ser Glu Cys 
500 505 510 

Ser Pro Ala Glu Asp Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin 
515 520 525 

Ser Gin 
530 

(2) INFORMATION FOR SEQ ID NO: 26; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTGCGGATCC TCTCAAGACA TGGATATAAA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
AGTAACAGGG CTGGCGCAAC GGTTC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
ACTGGCGATG GACCACTAAA GG 



